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Statement as to Rights to Inventions Made Under 
Federalfy^ponsored Research and Development 

Part of the work performed during development of this invention utilized 
U.S. Government funds. The V.S. Govonment has certain tig^s in this invention. 

Field of the Invendon 

The invention rdates to the field of gaie expression and gene therapy, and 
to novel vectors for these uses. In particular, the invention relates to the 
development and use of an artifidal or synthetic chromosome as a vector for gene 
expression and gene therapy, especially in humans. The invention enables the 
controlled construction of stable sjTithetic or artificial chromosomes fi-om isolated 
.purified DNA. With this DNA, a fimctional chromosome is &nned in a cdH and 
maintained as an estradiromosomal elane»t The artificial chromosome performs 
&e essential diromosomal fiinctions of naturally-occurring chromosomes so as to 
permit the chromosome to fiinction as an effective vector for gene therapy when 
therq)eutic DNA is included in the chromosonie. 

Background of the Invention 

The genetic manipulation of cdls lumed at correcting inherited or acquired 
disease is refared to as gene therapy. Until now, most clinical studies m this field 
have focused on the use of vu^ gene therapy vectors. Based on the results of 
these studies, it is becoming clear that current viral gene therapy vectors have 
severe climcal limitations. These include immimogemdty, cytopathicity, 
inconsistent gene expression, and liriiitations on the size of the therapeutic gene. 
For these reasons, much attention has been recentiy focused on the use of 
hon-vkal gene therapy vectors. 



Ih particular, synthetic mammalian chromosomes would be usefiil vectors 
for facilitating a variety of genetic manipulations to living cells. The advantages 
of synthetic mammalian chromosomes include hi^ nutotic stability, consistent and 
regulated gene expression, high cloning capacity, and non-immunogenicity. 

Artificial chromosomes were first constructed in S. cerevisiae in 1983 
(Murray ei al. Nature 505:189-193 (1983), and in S. pombe in 1989 
(Hahnenbetger ef Proc. Natl Acad. Set USA 55:577-581 (1989). For many 
reasons, however, it has not been obvious whether similar vectors could be made 
in mammalian cells. 

First, multicellular organisms (and thus the progenitors of mammalian 
cells) div^ged from yeast over 1 billion years ago. Although there are similarities 
among living organisms, in general, liie similarities among two organisms are 
invrasely related to the extent of thdr evolutionary divergence. Clearly, yeast, a 
umcellular organism, is radically difiermt biologically from a complex multicellular 
vertebrate. 

Second, yeast chromosomes are sev«^ orders of magnitude smaller than 
mammalian chromosomes. In S. cerevisiae and S, pombe, the chromosomes are 
0.2 to 2 megabases and 3.5-5.5 megabases in l^gth, respectively. In contrast, 
mammalian chromosomes range in size from approximately 50 megabases to 250 
megabases. Since there is a significant difference in size, it is not clear, a priori, 
vi)xs^Sasx constructs comparable to yeast artifidal chromosomes can be constructed 
and transected into mammalian cells. 

Tlurd, yeast chromosomes are less condensed tiian mammaHan 
chromosomes. This implies that mammalian chromosomes rely on more complex 
chromatin interactions in order to achieve this higher level of structure. The 
complex structure (both DNA structure and higher order chromatin structure) of 
mammalian chromosomes calls into question whether artificial chromosomes can 
be created in mammalian cells. 



Fourth, yeast centromeres are far less complex than mammalian 
centromeres. In^: cerevisiae, for example, the centromere is made up of a 125 bp 
sequence. In S. pombe, the centromere consists of approximately 2 to 3 copies 
of a 14 kb sequence element and an inverted repeat separated by a core region 
(-7 kb). In contrast, human centromeres are made up of several hundred 
kilobases to several megabases of highly repetitive alpha satellite DNA. 
Furthermore, in mammalian centromeres, there is no evidence for a central core 
region or inverted repeats such as those found in S. pombe. Thus, unlike yeast 
centromeres, mammalian centromeres are extremely large and repetitive. 

Fifth, yeast centromeres have far fewer spindle attachments than 
mammalian centromeres (Bloom, Cell 75:621-624 (1993)). S. cerevisiae, for 
example, has a single microtubule attadied to the centromere. In S. Pombe, there 
are 2-4 microtubules attached per centromere. In humans, on the other hand, 
there are several dozen microtubules attached to the centromere of each 
chromosome (Bloom, Cell 75:621-624 (1993)). This further illustrates the 
complexity of mammalian centromeres compared to yeast centromeres. 

Together, these differences are agnificant and do not suggest that a result 
in yeast can be reasonably expected to be transferable to mammals. 

Normal mammalian chromosomes are comprised of a continuous linear 
strand of DNA lan^ng in size from approximately 50 to 250 megabases, Li order 
for these genetic units to be faithfully repUcated and segregated at each cell 
division, it is believed that they must contain at least three types of functional 
elements: tdomeres, origins of replication, and centromeres. 

Telomeres in mammals are composed of the repeating sequence 
(TTAGGG)„ and are thought to be necessary for replication and stabilization of 
the chromosome ends. Origins of replication are necessary for the efficient and 
controlled replication of the chromosome DNA during S phase of the cell cycle. 
Although mammalian ori^ns of replication have not been well-characterized at the 
sequence level, it is believed that they are relatively abundant in mammalian DNA. 
Fmally, centromeres are necessary for the segregation of individual chromatids to 



the two daughter cells during mitosis to ensure that each daughter cell receives 
one, and only one, copy of each chromosome. Like origins of replication, 
centromeres have not been defined at the sequence level. Alpha satellite DNA 
may be an important centromeric component (HasfetaL, Cell 70:68 1-696 (1992); 
Larin et aL, Hum. Mol. Genet. 5:689-695 (1994); Willard, Trends in Genet. 
<?:410-415 (1990)). But there are cases of mitotically stable abnormal 
chromosome derivatives that apparently lack alpha satellite DNA (Callen et aL, 
Am. J. Med Genet ¥5:709-715 (1992); CroUa etal.,J. Med Genet. 2P:699-703 
(1992); Voullairee^a/., Am. J. Hum. Genet. 52:1153-1163 (1993); Blennowe/ 
al. Am. J. Hum. Genet. 54:%11-'&S3 (1994); Ohashi etal. Am. J. Hum. Genet. 
55:1202-1208 (1994)). Thus, at this time, the composition of the mammalian 
centromere remains pooriy understood. 

While others have clmmed to have produced "artificial" chromosomes in 
mammalian cells, no one has ever produced an artificial chromosome that contains 
only exogenous DNA. In each of these previous cases, the investigators either 
modified an ejdsting chromosome to make it smaller (the "pare-down" approach) 
or they int^rated exogenous DNA into an existing chromosome vMch. then broke 
to produce a chromosome fiiagment contmrang endogenous sequences fi-om the 
preexisting chromosome (the "fiagmentation" approach). Li the present invention, 
exogenous DNA sequences are introduced into human cells and form stable 
synthetic chromosomes without integration into endogenous chromosomes. 

Among the pare-down approaches^ three specific strate^es have been 
used: (1) telomere directed truncation via illegitimate recombination (Bamett, 
M.A. et al. Nucleic Acids Res. 21-21-36 (1993); Farr, C.J. et al, EMBO J. 
7^:5444-54 (1995)) (2) alpha satellite targeted telomere insertion/truncation via 
homologous recombination (Brown, K.E. et al. Hum Mol. Genet. 3:1227-37 
(1994)) (3) formation/breakage of dicentric chromosomes (Hadlaczky, G., 
Mammalian Artificial Chromosomes, U.S. Patent 5,288,625 (1994)). 



Bamette/a/. {Nucleic Acids Res. 27:27-36 (1993)), Fair efa/. (EMBO J. 
I4:S4/^/^.S/[ (1995)), and Brown et al (jSumMol Genet. 3:1227-37 (1994)) 
describe methods for fragmenting endogenous chromosomes by transfecting 
telomeric DNA and a selectable marker into mammalian cells. In each case, a 
truncated chromosome was created that was smaller than the original 
chromosome. The resulting truncated chromosomes contained Iaig;e amounts of 
oidogenous diromosome sequence, including the endogenous centromere. Thus, 
these chromosomes were not formed de novo. 

Hadlaczlq^ (Mammalian Artificial Chromosomes, U.S. Patent 5,288,625 
(1994)) describes a cell-line that can be use to propagate a chromosome that was 
formed as a result of a dicentric chromosome breakage event. All of the 
sequences, with the exception of a selectable marker were derived from the 
original. My frinctional dicentric chromosome. Thus, these so called "artifidal" 
chromosomes were not created de novo. 

Among the "fragmentation" approaches, Haaf et al. (Cell 70:681-696 
(1992)) and Praznovszky et al. (Proc. Natl Acad Sci. USA 55:11042-11046 
(1991)) describe methods for producing chromosome fragments by integrating 
trans&cted DNA into raidogenous chromosomes. Following transfection, the 
integrated DNA sequences become amplified (increase in copy number), and in 
some clones, a portion of the endogenous chromosome breaks off to produce a 
fragment that exists extrachromosomally. In both references, integrated 
transfected DNA can be found Pensively on the endogenous chromosome and 
the extrachromosomal fragment. 

In the experiments by Haaf al (Cell 70:681-696 (1992)), human alpha 
satellite DNA and the neomycin resistance gene were co-transfected into African 
Green Monkey cells. No other exogenous DNA was included in any of the 
transfections. In every transfection clone, DNA was found to be integrated into 
the aidogenous chromosomes. In one clone, which was also found to contain an 
extrachromosomal fragment, the transfected alpha satellite DNA had amplified 
extaidvely following integration. The authors conclude, based on Southern blot 



and Fluorescence In-Situ Hybridization, that AlSican Green Monkey sequences co- 
amplified with the transfected DNA and were interspersed among the alpha 
satdlite DNA. In further characterization of the chromosomes that contained 
amplified alpha satellite, it was found that "the number, size, and chromosomal 
location (telomeric, interstitial, or centromeric) of the transfected chromosome 
regions varied fi-om cell to cell within the population of line 3-3 1 cells, suggesting 
instability of the transfected sequences." Finally, analyas of the mitotic behavior 
of the chromosomes containing amplified alpha satellite DNA revealed a high 
incidence of anaphase bridges, suggesting that the chromosomes were dicentric 
(or multicentric). Thus, the high degree of observed structural instability in 
conjunction \wth the high incidence of anaphase bridge structures is consistent 
with the idea that the chromosome fiagment resulted fi-om an int^iation/ 
amplification/breakage event. Finally, it is also worth noting that in clones that 
contained integrated, unamplified alpha satellite DNA, no extrachromosomal 
firagments were obsaved, fiirther suggesting that amplification is important for the 
chromosome fi-agmentation process in this method. 

Praznovszkye/a/. (Proc. Natl Acad. Sci. USA 5<?: 11042-1 1046 (1991)) 
produced chromoson^ fiagm«its by integrating a piece of non-centromeric human 
DNA (lata- shown to map to human chromosome 9 qter by McGill et al. (Hum. 
Mol Genet 7:749-751 (1992)) and Cooper etal. (Hum. Mol. Genet. inSI-lSA 
(1992)) into an endogenous chromosome. Like the Haaf experiment, the 
integrated transfected DNA amplified extensively, and was found to be 
intersp«-sed wth mouse genomic sequences. The authors suggest that the 
integration/amplification of the transfected DNA resulted in the formation of a 
dicentric chromosome that then subsequently broke to produce chromosome 
fi-agments. Analysis of the chromosome fi-agments shows unambiguously that the 
chromosome fi-agments were derived from the mouse chromosome containing the 
integrated amplified DNA. 



There are a number of important similarities between the experiments by 
Haaf et. al and Praznovszky et. al First, both show that the transfected DNA 
integrated into endogenous chromosomes. Second, both show that following 
integration, the transfected DNA amplified extensively. Third, endogenous DNA 
(untransfected chromosomal sequences from the recipient cell ) was found to be 
interspersed throughout the amplified sequences. Fourth, the endogenous 
chromosomes containing the amplified transfected sequences stained with CREST 
antisera. Fiflh, the oidogenous diromosomes containing the amplified transfected 
sequences behaved similariy to dicentric chromosomes during mitosis. Finally, the 
endogenous chromosomes containing the amplified transfected sequences 
displayed structural instability. Thus, the large number of important similarities 
and the demonstrated chromosomal fragmentation by Praznovszky et, al. indicate 
a chromosome integration/amplificatioa1)reakage mechanism in both of these 
experiments. 

Further evidence that transfection and integration of alpha satellite DNA 
into mammalian chromosomes is not suflBcient to create extrachromosomal 
fragments in the absence of amplification was obtained by Larin et al. {Hum. 
MoL Genet 5:689-95 (1994)). In these e^eriments, alpha satellite DNA linked 
to a selectable maricer was transfected into human cells. In every drug-resistant 
clone, the alpha satellite DNA was integrated into an endogenous chromosome. 
While these integrations formed centromere-like structures (i.e. primary 
constrictions, CREST antisera staining, and lag^g chromosomes during 
anaphase), no extrachromosomal fi-agments were observed in any clone. Since 
these experiments felled to provide clones with chromosomes containing the 
transfected alpha satellite DNA and not an endogenous centromere, there is no 
reliable method to determine whether the centromere-like structures that fiirmed 
are capable of facilitating chromosome segregation. 



Since each of the "pared-down" chromosomes was created from a pre- 
existing chromosome and smce each of the "fragmentation" chromosomes was 
CTeated by integrating DNA into pre-existing chromosomes, these references do 
not provide gddance about how to create chromosomes dz novo from transfected 
naked DNA. 

Rjrthennore, these chromosomes and the approaches used to make them 
have severe limitations as gene therapy vectors for several reasons. First, the 
methods used to make them can only be used to create the chromosomes in cell 
culture. Since the breakage events are either extremely rare and/or produce 
diromosomes vwth unpredictable structure, these methods are not compatible with 
direct use in patients' cells. Additionally, the instability of the amplified sequences 
in tl^ fragmentation approach is inconsistent with use in patients due to the risks 
of genomic rearrangements that, in turn, may lead to cellular transformation and 
cancer. 

It would be highly desirable, therefore, if there were a prefabricated 
chromosome vector with defined structure that could be introduced directly into 
patient's cells, especially a vector that did not depend upon mtegration into 
endogenous chromosomes or subsequent amplification, and w^ere the structure 
of tiie construct in the cell is substantially identical to its structure prior to 
transfectioa 

Second, pared-down chromosomes and chromosome fragments are 
composed of undefined endogenous sequences and provide no guidance for 
identifying sequences that are fimctionally important. 

It would be highly dearable, therefore, to provide vectors composed of 
defined sequences and the methods to produce these defined synthetic 
chromosomes that allow other fimctionally important sequences to be rapidly 
identified. 



Third, the chromosomes produced by the pare-down and fragmentation 
approaches can not be substantially purified using currently avmlable techniques. 
Thus, it is difficult to deliver these pared-down chromosomes to mammalian cells 
without delivering other mammalian chromosomes. 

It would be highly desirable, therefore, to pro\dde substantially purified 
genetically engine^d DNA that can be introduced into a cell and form a 
fimctional chromosome. 

Fourth, since these pared-down chromosomes and chromosome fragments 
have never been isolated as naked DNA and reintroduced into a cell, up to the 
present, it was never clear xdiether any exogenous DNA could be introduced into 
a cell to produce a fiinctional chromosome de novo (without integrating into the 
host chromosomes first). 

It would be highly desirable, therefiare, to provide artifidal mammalian 
chromosomes that are created de novo by introducing purified DNA into a 
mammalian cell. 

Finally, it is very difficult to add new DNA sequences (e.g. therapeutic 
genes) to the pared-down chromosomes and chromosome fragments. 

It would be Mghly dearable, therefi)re, to provide vectors created in vitro, 
where pladng new DNA sequences onto the vectors is straight-forward and 
efficient. 

Sun et at. (Nature Genetics 8:33-41 (1994)) describe a viral-based vector 
system designed for use in himian cells. The vector is described as a "human 
artificial episomal chromosome." However, the vector rdies on the presence of 
EBNA-1, a toxic and immunogemc viral protein. Further, the vector relies on a 
viral origin of replication and not on a natural mammalian chromosomal 
replication origin. Further, the "chromosome" does not contain fiinctional 
centromeric or tdomeric DNA, and does not form a fiinctional Idnetochore during 
nutosis. As a result, such a vector does not segregate in a controlled manner. 
Finally, the vector is present in the cell at an elevated copy number that ranges 
from 50 to 100 copies per cell, unlike endogenous chromosomes. Based on these 
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criteria for defining mammalian chromosomes, this vector cannot be properly 
designated a "human artifidal chromosome" because it has different properties and 
fimctions by unrelated mechanisms. 

Thus, there is still a clear need for a wholly synthetic or artificial 
chromosome made fi^om DNA that can be manipulated in vitro and, upon 
transfection into cells, will adopt a functional chromosome structure and will 
direct gene expression in a controlled manner. 

Li contrast to the cited art, several embodiments of the current invention 
describe a prefabricated chromosome vector vnth defined structure and 
composition that can be introduced directly into patients' cells. Since the vector 
described in this invention does not depend upon integration into endogenous 
chromosomes or aibsequent amplification, &e structure of the construct in the cell 
is substantijdly identical to its structure prior to fransfection. 

In contrast to the cited art, the vectors described in this present invention 
are composed of defined sequences. Furthermore, the methods used to produce 
these synthetic chromosomes allow other fimctionally important sequences to be 
rapidly identified. 

In contrast to the cited art, \wth the present invention, the inventors 
demonstrate for the first time that artifidal mammalian chromosomes can be 
o^ted de novo by introdudng purified DNA into a mammalian cell. 

In contrast to the cited art, since the vectors described in the present 
invention are created in vitro, placing new DNA sequences onto the vector is 
straight-forward and efficient. 



Summary of the Invention 



The inventors have developed methods for producing large quantities of 
purified intact alpha satellite arrays of up to 736 kb in length. By transfecting 
these arrays into human cells along with telomeric DNA and human genomic DNA 
sequences, several wholly synthetic human chromosomes that exhibit a Wgh 
degree of mitotic stability in the absence of selection have been produced. 

Unlike previous approaches whereby attempts were made to produce an 
artifidal mammalian chromosome, this approach does not rely on the modification 
of existing endogenous chromosomes. Furthermore, it does not produce multiple 
integration events within the endogenous chromosomes. These chromosomes 
were formed and maintained extrachromosomally, so integration into an 
endogenous chromosome is avoided. 

The relatively high frequency of synthetic chromosome formation and the 
lack of other genomic rearrangements assodated vAth the chromosome formation, 
allows the synthetic chromosomes made by the inventors to be used as effective 
vectors for heterologous gene expression and gene therapy. 

The invention is thus based on the inventors' discovery that by means of 
isolated purified DNA alone, a synthetic or artificial chromosome is produced de 
novo (firom purified DNA) in a cell and is produced and maintained as an 
extrachromosomal element. This chromosome retains the essential fimctions of 
a natural mammalian chromosome in that it is stably maintamed as a non- 
integrated construct in (fividing mammalian cells without selective pressure, just 
as naturally-occurring chromosomes are inherited. For a linear chromosome, this 
indicates centromeric, telomeric, and origin of replication fimctions. 

The invention is thus directed to a synthetic or artifidal mammalian 
chromosome. The chromosome is product fi-om isolated purified DNA. The 
isolated purified DNA is transfected into mammalian cells. Without integrating 
into an endogenous chromosome, it forms a fimctional chromosome. This 
chromosome is not derived from an aidogenous naturally-occurring chromosome 
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m situ. The starting material is isolated purified centromeric DNA and DNA that 
allows chromosome formation without integration. For linear chromosomes, 
telomeric DNA is included. In a preferred embodiment, the DNA that allows 
chromosome formation without integration is genomic DNA (from the naturally- 
occurring genome of an organism). 

The artificial mammalian linear chromosome thus preferably essentially 
comprises centromeric, telomeric, and genomic DNA. In one embodiment, the 
artificial chromosome is a drcular chromosome. In this case, telomeric DNA is 
absent since it is not necessary to replicate chromosome ends. 

The g^iomic DNA is a subgenomic DNA fragment that is a restriction 
enzyme digestion Augment, a Augment produced by mechanical shearing of 
gaiomic DNA, or a s^mthetic fi:agment synthesized in vitro. The genomic DNA 
starting material that is transfected) can be a mixture of heterogeneous 
fragments (e.g., a restriction digest) or can be a cloned fragment or fragments 
(homogeneous). 

Centromeric DNA comprises a DNA that directs or supports kinetechore 
formation and thereby enables proper chromosome segregation. Centromeric 
DNA at active, fimctional, caitromeres is assodated with CENP-E during mitosis, 
as demonstrated by immunofluorescence or immunoelectron microscopy. By 
"associated" is meant that the centromeric DNA and CENP-E co-localize by 
fluorescence in situ hybridization (FISH) and hnmunofluorescence. 

Telomeric DNA comprises tandem repeats of TTAGGG that provide 
telomere fimction, i.e., replicate the ends of linear DNA molecules. Telomeric 
DNA is included as an optional component, to be used when linear chromosomes 
are desired. This is indicated herein by enclosing the terms "telomeric"/ 
"telomere" in parentheses. 

Prior to transfection, the DNA can be naked, condensed with one or more 
DNA-condensing agents, or coated with one or more DNA-binding proteins. 
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The invention is also directed to an artificial mammalian chromosome 
produced by the process of introdudng into a mammalian cell the isolated purified 
DNA jfragments above. In a preferred embodiment the process uses DNA 
essentially comprising centromeric, telomeric, and genomic DNA. 

The various fi-agments can be transfected separately or one or more can 
be ligated prior to transfection. Thus the centromeric (telomeric) and genomic 
DNAs are introduced sqparately (unligated) or one or more of the isolated purified 
DNAs are ligated to one another. 

The invention is also directed to a mammalian cell containing and 
compositions comprising the artificial mammalian chromosome. 

The invention is also directed to the isolated purified DNA described 
above, and which forms an artifidal mammalian chromosome when introduced 
into a mammalian cell. In preferred embodiments, the isolated purified DNA 
essentially comprises centromeric, telomeric, and genomic DNA. 

The invention is also directed to a mammalian cell containing and 
compositions comprising the purified DMA. 

The invailion is also directed to a vector or vectors contaimng the purified 

DNA. 

The invention is also directed to a mammalian cell containing and 
compositions comprising the vector(s). 

The invention is also directed to the isolated purified DNA described 
above produced by the process of combining one or more of the DNAs described 
above. In preferred embodiments, the DNA includes: (1) centromeric DNA, 
(2) telomeric DNA, (3) genomic DNA. The DNAs can be unligated or one or 
more can be ligated to one another. 

The invention is also directed to a method for making an artificial 
mammalian chromosome by introducing into a mammalian cell the purified DNA 
described above. 
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The invention is also directed to a method for maJdng DNA capable of 
forming an artificial chromosome, the method comprising combining in vitro the 
DNA described above. 

The invention is also directed to a method for propagating an artificial 
chromosome in mammalian cells by introducing the purified DNA into a 
mammalian cell and allowing the chromosome to replicate. 

Ixv a preferred embodiment, the invention is also directed to methods for 
expressing a heterologous gene in a mammalian cell by expressing that gene fi-om 
the artificial mammalian chromosome. 

Thus, the invention is also directed to methods for providing a desired 
gene product by including a deared gene on the artifidal chromosome such that 
the gene of int^est is expressed. In preferred embodiments, the invention 
pro>ddes a method of gene therapy by including heterologous therapeutic DNA on 
the artifidal mammalian chromosome, such that there is a therapeutic effect on the 
mammal contmning the chromosome. 

In a preferred embodimoit of the invention, the c«itromaic DNA is alpha- 
satellite DNA. 

In a preferred embodiment of the invention, the artificial mammalian 
chromosome is derived entirely fi-om human DNA sequences and is fimctional in 
human cells. 

Brief Description of the Figures 

Figure 1. Method for producing large head-to-tail tandem arrays of alpha 
satellite DNA. pVJ104-Yal6 was linearized with BamHI and Sfil, and purified 
by pulsed fidd gel electrophoresis (PFGE). Likewise, pBac-Yal6 was linearized 
with BamHI and Bglll and the alpha satellite array was purified by PFGE. A) 
The purified arrays were incubated together in the presence of ligase, BamHI and 
Bgin. Since BamHI and Bglll are complementary/nonisoschisomeric overhangs, 
a ligation event resulting in a J?a»ifflZfi!g/ff junction (as is the case in a head-to-tail 
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joining) will destroy both sites. Thus, a head-to-tail junction will be resistant to 
cleavage hy BamHI and Bglll. In contrast, a head-to-head, or tail-to-tail ligation 
event will recreate a BamHI or Bglll site, respectively. Since BamHI and Bglll 
are present, these ligation products will be cleaved to produce their constituent 
monomers (or head-to-tail multimers). By controlling the amount of ligase, the 
incubation time, and the concentration of DNA, the length of the head-to-tail 
products can be varied as necessary. B) Following ligation, the products were 
analyzed by PFGE. Lane 1, molecular weight standards (NEBL Midrange II 
markers); lane 2, Yal6 (BamHI/Bglll ^agment) ligated in the presence of BamHI 
and Bglll for 4 hours; lane 3, Yal6 (BamHI/Bglll fragment) ligated in the 
presence of BamHI/Bglll for 12 hours; lane 4, Yal6 (BamHI/BglH fragment) 
mock-ligated in the presence of BamHI and BgUI; lane 5, VK75 (BssHII 
fragment) ligated for 12 hours without restriction enzyme; lane 6, VK75 (BssHII 
fragment) ligated for 12 hours in the presence of BssHII; lane 7, VK75 (BssHII 
fragment) mock-ligated. The molecular weight of ligation products are shown on 
the left. Note: Although these samples were nm on the same gel, several 
irrelevant lanes between lanes 4 and 5 were removed. 

Figure 2, Strategy for making synthetic chromosomes. 

Figures. Anafysis of synthetic chromosomes from clones 22-7 and 22-13 
by fluorescent in situ hybridization (FISH). Cells were harvested, dropped onto 
glass slides, and hybridized to Y alpha satellite DNA as described in the 
Experimental Procedures (See Examples herein). The biotinylated probe was 
detected using Texas Red A\idin and amplified with two layers of biotinylated 
anti-Avidm and Texas Red Avidin. A) DAPI image of a metaphase spread from 
clone 22-7. B) Same as A) except that the alpha satellite probe was visualized 
using a triple cube filter. C) DAPI image of a metaphase spread from clone 
22-13. D) Same as C) except that the alpha satellite probe was visualized using 
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a triple cube filter. In each case, the synthetic chromosome is indicated wth a 
white arrow. 

Figure 4, Analysis of synthetic chromosomes from clones 22-6 and 23-1 
by FISH. Cells were harvested, dropped onto glass slides, and hybridized to Y 
alpha satellite DNA (clone 22-6) or 17 alpha satellite DNA (clone 23-1) as 
desoibed in the experimental procedures. The biotinylated probe was detected 
usang Texas Red A'wdin and amplified with two layers of biotinylated anti-Avidin 
and Texas Red Avidin. A) DAPI image of a metaphase spread from clone 22-6. 

B) Same as A) except that the alpha satellite probe was visualized using a triple 
cube filter. C) DAPI image of a metaphase spread from clone 23- 1 . D) Same as 

C) except that the alpha satellite probe was visualized using a triple cube filter. 
In each case, the synthetic chromosome is indicated with a white arrow. In D), 
the yellow arrow indicates the location of tiie C qter integration site. 

Figure 5. Analysis of synthetic chromosomes from clones 22-11 and 
17-15 by FISH. Cells were harvested, dropped onto glass slides, and hybridized 
to Y alpha satellite DNA (clone 22-1 1) or 17 alpha satellite DNA (clone 17-15) 
as desoibed in the experimental procedures. The biotinylated probe was detected 
usang Texas Red A\idin and amplified with two layers of biotinylated anti-Avidin 
and Texas Red Avidin. A) DAPI image of a metaphase spread from clone 22- U . 
B) Same as A) exc^t that the alpha satellite probe was visualized using a triple 
cube filter. C) DAPI image of a metaphase spread from clone 17-15. D) Same 
as C) except that the alpha satellite probe was visualized using a triple cube filter. 
In each case, the synthetic chromosome is indicated with a white arrow. InD), 
the yellow arrow indicates the location of the C qter integration site. 
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Figure 6. Determination of the amount of transfected alpha satellite DNA 
presait in clones containing the synthetic chromosome. A) Total genomic DNA 
was harvested, digested, and electrophoresed as described in the Experimental 
Procedures. Lane 1, HT1080; lane 2, clone 22-6; lane 3, clone 22-7; lane 4, 
clone 22-11; lane 5, clone 22-13; lane 6, clone 23-1. B) The estimated amount 
of synthetic Y alpha satellite DNA is shown for each clone. Note: clone 23-1 was 
transfected with 17 alpha satellite DNA, and therefore, does not contmn synthetic 
Y alpha satellite DNA. 

Figure 7. CENP-E is associated with the synthetic chromosomes during 
mitosis. LnmunofluorescCTice was carried out on metaphase chromosomes 
harvested from synthetic chromosome-containing clones as described in 
expaimental procedures. A) DAPI- stained chromosomes from clone 22-11. B) 
Same as A) except the location of the anti-CENP-E antibodies is visualized using 
a triple oibe filter, C) DAPI- stained chromosomes from clone 23-1. D) Same 
as C) fxc&pt tiie location of the anti-C]0«]P-E antibodies is visualized using a triple 
cube filter. In each case, the synthetic chromosome is in(Kcated by a white arrow. 

Figure 8. X-Gal plate staining of clone 22-1 1 after growth for 70 days in 
the absence of selection. Cells were harvested and stained as described in the 
Experimental Procedures herein. A) HT1080 B) Clone 22-11. The presence 
of blue cells in clone 22-11, but not in HT1080 indicates that P-geo is still 
expressed in these cells. 

Detailed Description of the Preferred Embodiments 

The inventors have discovered that fiinctional mammalian chromosomes 
can be constructed from purified DNA introduced into a mammalian cell. There 
are sevCTal advantages to using these chromosomes for a variety of applications. 
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First, since they are fonned and replicate autonomously, they will not 
result in insertional mutagenesis by inserting into the host genome. 

Second, because of the large size of the transfer vector On the megabase 
range), there is the capacity to accommodate the entire repertoire of a large gene 
includmg all of its regulatory elements. This itself may encompass megabases of 
DNA. 

TWrd, because some genetic diseases are the result of defects in more than 
one gene, because of the large size of the mammalian artificial chromosome more 
than one gene can be accommodated. 

Fourth, they are stable and can thus provide a therapeutic benefit over 
many cell divisions. 

Fifth, they are non-immunogenic. 

The invention is thus directed to a synthetic or artificial mammalian 
chromosome comprising essentially centromeric, genomic, and optionally, 
telomeric DNA. In an alternative embodiment, the artificial chromosome is a 
circular chromosome. In this case, telomeiic DNA is absent since it is not 
necessary to replicate chromosome ends. The chromosome has, at the minimum, 
DNA sequences that provide essential chromosomal fiinctions in a mammalian 
cell 

The genomic DNA is a subgenomic DNA fi-agment selected fi-om the 
group consisting of restriction enzyme digestion fi-agments mechanically sheared 
Augments, and fi-agments of DNA synthesized in vitro. The genomic DNA 
component of the chromosome can be derived fi-om a mbrture of subgenomic 
fi"agments (e.g., a restriction enzyme digest) or fi-om cloned fi-agment(s). 

The fimction of the genomic DNA is two-fold. The DNA expresses a gene 
product, or causes the expression of a gene product (as, for example, by having 
a regulatory fimction), and the DNA allows the formation of an artificial 
chromosome fi-om pimfied DNA in a cell without the integration of the purified 
DNA into an endog^ous chromosome in the cell, the artifidal chromosome also 
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containing centromeric DNA and, optionally, telomeric DNA. The genomic DNA 
can be derived from any organism and can be of any size. 

The genomic DNA that forms a component of the synthetic mammalian 
chromosome may be derived from a mammalian source other than the mammal 
from which the cdl is derived in which the chromosome replicates. For example, 
mouse genomic DNA can be pro\dded to human cells and human genomic DNA 
can be provided to the cells of other mammals. Further, it can be from a source 
different from the source of the centromere or telomere. 

Still frirther, the function of the genomic DNA exemplified herein can be 
potentially carried out by genomic DNA of any organism, including procaiyotic 
organisms, and by DNA synthesized in vitro and not corresponding to a naturally- 
occurring sequence, partly homologous to a naturally occurring sequence or 
completely non-homologous. 

Centromeric DNA essentially comprises a DNA that directs or supports 
kinetechore formation and thereby enables proper chromosome segregation. This 
centromeric DNA at active, functional centromeres is associated with CEMP-E 
during mitosis, as demonstrated by immunofluorescence or immunoelectron 
nucTOSCopy. By "assodated" is meant that the centrom^c DNA and CENP-E co- 
localize by FISH and immunofluorescence. 

In a preferred ©nbodiment of the inv«ition, the centromeric DNA is alpha 
satellite DNA. However, any functional centromeric DNA, and especially 
repetitive DNA, is «iabled by the methods described herein and usefiil for making 
artificial chromosomes. 

The inventors have created in vitro methods for producing large alpha 
satellite arrays. Previously, no method has been available allowing structurally 
intact alpha satellite DNA greater than 200 kb to be purified in the quantities 
necessary for the transfection of mammalian cells. By using these methods, 
controlled amounts of alpha satellite DNA can be produced in vitro. As described 
herein, by empirically controlling the amount of ligase, incubation time, and 
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concentration of DNA, the length of the ultimate product can be varied as 
necessary. 

However, the invention is not limited to centromeric DNA derived from 
alpha satellite DNA. The in vitro methods created by the inventors can be applied 
to any centromeric DNA that flinctions as described herein, and especially to 
repetitive DNA. 

Further, the entire alpha satellite repeat may not be required for 
centromere formation. Thus, the centromeric DNA can also comprise alpha 
satellite derivatives and analogs, for example, sub-monomer re^ons in alpha 
satellite or related satellite DNA. 

Subre^ons vn&an the alphoid monomer representing protein binding sites , 
can be ligated together to generate a functional centromere, consisting of a smaller 
r^jeat unit. The fiinctionality of this embodiment is shown by data from mouse- 
human hybrids. 

In the murine spedesM musculus, mmor satellite DNA contains CENP-B 
boxes and appears to be the ftmctional equivalent of alpha satellite DNA. 
fijterestii^y, inM musculus^ the minor satellite rq)eat imit is only 120 bp and has 
no apparent sequence homology to alpha satellite DNA outside of the CENP-B 
box. Despite the difference in repeat size and sequence, human chromosomes 
segregate eflBciently in mouse/human hybrids. This demonstrates that the 
centromeric repeat unit aze and sequence can vary without destroying centromere 
function. 

The murine species M caroU apparently lacks minor satellite DNA 
(Kipling et al, Mol Cell. Biol. 75:4009-4020 (1995)). In this species, the 
functional alpha satellite equivalent appears to be a 79 bp satellite sequence that 
contains a CENP-B box (there is also a 60 bp sequence that is 97% homologous 
to the 79 bp sequence but that lacks a CENP-B box). In crosses between M 
musculus and M carotid chromosomes from both species segregate normally 
vwtWn the same cell. This shows that both the mmor satellite and the 79 bp 
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satellite sequences are recognized by the same spindle during mitosis. Thus, 
different centromeric repeat sizes can be functional. 

Since alpha satellite, minor satellite, and 79 bp satellite repeats are different 
sizes and are functional, the absolute repeat size per se is not the determinant of 
functionality of centromeric DNA. Additionally, since there is only limited 
sequence homology betweai these centrom«ic repeats, it is likely that subregions 
within the repeats representing protem binding sites are the important fiinctional 
component. 

Thus, in one embodiment of this invention, the centromeric DNA contains 
subregions within alpha satellite DNA. In a preferred embodiment, the 
centromeric DNA is composed of tandemly ligated CENP-B boxes, defined by the 
sequaice 5'aTTCGttggAaaCG<5Ga3', where the bases indicated by capital/bold 
letters are the most important for CENP-B binding and the bases indicated by 
lower case letters may be substituted with other bases. 

In othOT «nbodiments, alphoid equivalents from other species are used for 
centromeric DNA. Human and other mammalian chromosomes have been shown 
to segregate effidently in cells from other spedes as demonstrated by interspedes 
somatic cell hybrids. Ejcamples of these hybrids include mouse x human, hamster 
X human, rat x human, hamster x mouse, rat x mouse, and chicken x human. The 
stability of a human chromosome in chicken cells (Dieken, E,, et aL, Nature 
Genet. 72:174-182 (1996)) shows that human ceitromeric DNA is also fiinctional 
in a non-mammalian species (i.e., a>^an). 

Based on observations from cross-species hybrids, it is clear that 
chromosomes from one species are functional in other species. Therefore, 
synthetic chromosomes can be produced in human cells using centromeric repeats 
from otiier mammals (and avians) instead o^ or in conjunction Awth, alpha satellite 
DNA. Cam&Lsehjr, alpha satellite DNA can be used as the source for centromeric 
DNA in other mammalian (and avian) species. 
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Thus, in a further embodiment of the invention, genomic (telomeric) DNA 
is transfected into cells along with M musculus minor satellite DNA, Mus caroli 
79 bp satellite DNA, or mialogous sequences from other mammals. In another 
embodiment, telomeric and genomic DNA is transfected into cells along with 
c«itromeric DNA from avian cells. * 

Essentially, centromeric DNA that is associated with CENP-E during 
mitosis is embodied in the aspect of the invention that encompasses the use of 
centromeric sequences heterologous to the host cell and other synthetic 
chromosomal components. As long as the centromeric sequence in the 
diromosome is assodated with C3ENP-E during mitosis, a functional chromosome 
for mammalian cells would be expected to result irrespective of the genomic 
sequence(s) and telomere sequences, and for that matter, irrespective of the 
^edfic centromeric sequence. 

The telomeric DNA can be derived from any DNA sequence (from any 
desired species) that retains a telomeric fimction. In mammals and other 
vertebrates, the most abundant and conserved sequence at the chromosome end 
is TTAGGG, which forms arrays between 2 and 20 kilobases in length. Human 
telomere DNA consists of about 5 kilobases of the tegeat TTAGGG, and small 
stretches of this sequoice are enough to seed telomere formation after 
introduction of linear molecules into mammalian cell lines (Huxley, C, Gene Ther. 
i:7-12 (1994)). Simple CrTAGGG)„ arrays are sufficient to provide the telomere 
fimction required by an artificial chromosome. The telomeric DNA, therefore, 
oonqmses tand«n arrays of the hexamer TTAGGG. Telomeric DNA is included 
when the formation of linear chromosomes is desired. 

Telomeres, centromeres and replication origins are discussed in Huxley, 
C. etcd., Biotechnol 72:586-590 (1994). 

Hie invention is also directed to purified DNA molecules that essentially 
comprise centromeric, genonuc, and optionally, telomeric DNA, as described 

In one embodiment, the purified DNA is naked DNA 
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Ih another embodiment, the purified naked DNA is condensed with one or 
more agents that condense DNA. It may be advantageous to condense the 
purified DNA prior to transfection in order to stabilize it against shearing. By 
condensing the purified centromeric (telomeric) and genomic DNA prior to 
transfection, it will become more resistant to structural insult arising fi-om 
manipulations during transfectioa Thus, in one embodiment of this invention, the 
purified c«itromeric (tdomeric) and genonuc DNA is condensed with one or more 
DNA condensing agents prior to transfection. In this respect, polycations have 
been shown to physically condense high molecular weight DNA and to protect it 
from mechanical shearing (Kovacic et aL, Nucleic Acids Res. 25:3999-4000 
(1995); Widom and Baldwin, /. Mol. Biol. 7^^:431-453 (1980); Widom and 
Baldwin, Biopolymers 22:1595-1620 (1983)). Therefore, in a fiirther 
embodiment, the purified DNA is condensed with polycationic compounds. 
Examples of polycationic compounds include poly-lysine, poly-arginine, 
spermidine, spermine, and hexaminecobalt chloride. 

In an alternative embodiment, the invention encompasses precoating DNA 
with proteins. It may also be advantageous to precoat the DNA with DNA- 
binding protdns such as histones, nonhistone chromosomal proteins, telom^e 
binding protdns, and/or c^tromere bincfing proteins. This precoating is expected 
to have sev«^ dearable consequences. First, it will result in condensation of the 
DNA which \wll protect the high molecular weight DNA fi-om shearing. Second, 
it -vwll inlubit nuclease degradation of the transfected DNA by blocking nucleases 
firora binding to the DNA. Third, the precoated DNA may entw the nuclease 
more efficiently following transfection, since each of the proteins listed above 
contain nuclear localization signals. By precoating the centromeric (telomeric) 
and genomic DNA with DNA binding proteins prior to transfection, we expect to 
inCTease the eflSciency of transfection and synthetic chromosome formation. 
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Thus, in another embodiment of this invention, the purified centromeric 
(telomeric) and genomic DNA is coated with DNA binding proteins prior to 
transfection. Examples of DNA-binding proteins include histones, non-histone 
chromosomal proteins, transcription factors, centromere binding proteins, and 
telomere binding proteins. 

DNA-binding proteins can also be identified and purified by thdr affinity 
for DNA. For example, DNA binding may be revealed in filter hybridization 
©cperiments in whidi the protein (usually labeled to fedlitate detection) is allowed 
to bind to DNA mobilized on a filter or, vice-versa in which the DNA binding site 
(usually labeled) is bound to a filter upon which the protein has been immobilized. 
The sequence specificity and affinity of such binding is revealed with DNA 
protection assajra and gel retardation assays. Purification of such protdns may be 
paformed utili2dng sequence-specific DNA affimty chromatography techniques, 
for example column chromatography with a resin derivatized with the DNA to 
which the domain binds. Proteolytic degradation of DNA-binding proteins may 
be used to reveal the domain which retains the DNA binding ability. 

The invention is thus directed to an artificial mammalian chromosome 
produced by the process of transfecting a mammalian cell with the purified DNA, 
described herdn, and allowing the cell to completdy reconstitute the DNA in vivo. 

The invention is thus directed to an artifidal mammalian chromosome 
produced by the process of transfecting a mammalian cell with purified naked 
DNA, the DNA comprising essentially centromeric DNA (telomeric DNA) and 
genomic DNA, as described hereia 

The invention is thus also directed to an artificial chromosome produced 
by the process of transfecting a mammalian cell with purified condensed DNA, the 
DNA comprising essentially, centromeric DNA (telomeric DNA), and genomic 
DNA, as described herein. 
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The invention is thus also directed to an artificial mammalian chromosome 
produced by the process of introducing purified coated DNA into a mammalian 
cell, the DNA comprising essentially a centromere (a telomere) and genomic 
DNA, as described herein. 

The invention is also directed to purified DNA made by the process of 
combining, in vitro, isolated purified and genomic DNA (telomeric DNA) as 
described herein. 

The invention is also directed to purified, condensed DNA made by the 
process of combimng, in vitro, isolated purified centromeric DNA (telomeric 
DNA) and gnomic DNA, as described herein. Alternatively, the individual DNA 
components could be pre-condensed and then combined. 

The invention is also directed to purified, coated DNA made by the 
process of combining, in vitro, isolated purified centromeric DNA (telomeric 
DNA) and g«iomic DNA, as desaibed herein and adding DNA-binding proteins. 
Alternatively, the individual DNA components could be pre-coated and then 
combined. 

The purified DNA described above may comprise imligated centromeric 
(telomeric) and genomic DNA. Alternatively, the purified DNA described above 
can also comprise centromeric (telomeric) and genomic DNA in which one or 
more of these DNAs are ligated to each other. 

The invention is also directed to a composition comprising the purified 
DNA described above. The composition may contain components that facilitate 
the entry of the DNA into a cell. For the formation of an artificial chromosome, 
the composition may facilitate the uptake of the DNA into a manunalian cell. 
Alternatively, the composition may comprise ingredients that fecilitate the uptake 
of the DNA into a cell which is used for propagation of a vector containing the 
DNA. 
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The invention is also directed to a vector containing the DNA described 
above. The vector may be used for propagating the DNA, i.e., amplifying the 
sequences described above prior to introducing them into a mammalian cell and 
forming an artificial chromosome. 

Accordingly, the invention is also directed to a composition comprising the 
vector containing the DNA described above. 

The invention is also directed to a cell containing the vector described 

above. 

The invention is also dkected to a mammalian cell contmning the artifidal 
chromosome. 

The invention is also directed to a mammalian cell containing the purified 
DNA described above. 

Although any mammalian cell is encompassed by the invention, in 
preferred embodiments of the invention, the mammalian cell is a human cell. 

In preferred embodiments of the invention, the centromeric DNA is human 
alpha satellite DNA. It is understood, however, that alpha satellite DNA may be 
derived fix)m any primate. The invention further encompasses centromeric DNA 
from non-piimate mammals, wherein said centromeric DNA is assodated with 
CENP-E during mitosis. Any centromeric DNA that is associated with CENP-E 
during mitosis, and especially repetitive DNA, irrespective of the organism from 
which it is derived, is expected to provide fiinctional centromeric sequences for 
an artifidal mammalian chromosome according to the present invention. Thus, an 
artifidal mammafian chromosome that functions in human cells, for example, may 
contain centromeric sequences derived not from humans but from non-human 
mammals and even from non-mammalian species such as avians. Any repetitive 
DNA that is assodated with CENP-E is potentially useful. Accordingly, following 
the methods taught herein, any centromeric sequence can be tested for fimction 
as a component of a mammalian artifidal chromosome. 
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In spedfic disclosed embodiments of the invention, the centromeric DNA 
comprises large stretches of alpha satellite array, a segment composed of the 
repeating telomeric sequence (TTAGGG)„, and random genomic fragments 
produced by digestion with the restriction enzyme Notl. In preferred 
embodiments, the restriction enzyme digests DNA into pieces in the range of 
fragments generated hy Notl digestion of human genomic DNA and preferably in 
the range of 10 kb to 3 mb. This includes but is not limited to BamHI, BgU, Sail, 
Xhol, Sfll, Notl, Srfl, Pmel, ondAscL 

When the purified DNA is introduced into a mammalian cell, this DNA 
forms a fimctional synthetic or artificial chromosome. This chromosome has the 
characteristics of a naturally-occurring mammalian chromosome. The 
chromosome is present in the cell at a low copy number, usually one per cell. The 
chromosome is linear and contains telomeric sequences. CENP-E is associated 
with the artificial chromosome during mitosis, indicating the formation of a 
fijnctional kinetechore. The chromosome is mitotically stable in the absence of 
sdection. The chromosome is structurally stable with time with an undetectable 
integration frequenqr. The chromosome contains one or more transcriptionally 
active genes. Thus, these chromosomes do not originate from naturally-occurring 
chromosomes but are constructed starting in vitro fix)m isolated purified DNA 
sequences. 

Accordingly, the invention is also directed to a method for making an 
artificial mammalian chromosome, the method comprising introducing into a 
mammalian cell the purified DNA described above. 

The DNA can be introduced into the mammalian cell by any number of 
methods known to those in the art. These include, but are not limited to, 
electroporation, calcium phosphate precipitation, lipofection, DEAE dextran, 
liposomes, receptor-mediated endocytosis, and particle delivery. The 
chromosomes or DNA can also be used to microinject eggs, embryos or ex vivo 
or in vitro cdls. Cells can be transfected with the chromosomes or with the DNA 
described herein usmg an appropriate introduction technique known to those in 
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the art, for example, liposomes. In a preferred embodiment of the invention, 
introduction of purified DNA into the mammalian cell is by means of lipofection. 

The purified DNA is thus useful for transfecting a mammalian cell, said 
transfecting resulting in the formation of an artificial chromosome in the cell fi-om 
the transfected DNA. 

The DNA can be propagated in non-mammalian cells separately or where 
one or more of the components is ligated together. Thus, the invention is also 
(Urected to the purified DNA ligated into a vector for propagation. Such vectors 
are well-known in the art and include, but are not limited to, pBac/108L, PI, 
pACYC184, pUC19, pBR322, YACs, and cosmids. 

The invention is also directed to a mammalian cell containing the artificial 
or synthetic chromosome. The invention is directed to any mammalian 
chromosome or mammalian cell. Although all mammals are encompassed, the 
preferred embodiment is the human. A preferred embodiment of the invention 
therefore encompasses a human cell containing a synthetic human chromosome. 

The DNA and chromosomes have been developed especially for use as 
expresdon vectors for gene therapy and other purposes. Therefore, in preferred 
embodiments of the invention, the purified DNA also consists essentially of one 
or more DNA sequences useful for the expression of a desired gene product, for 
example, as therapeutic agents. The invention is thus directed to a method for 
introdudng expresdble DNA into a cell by including this DNA on the artificial 
chromosome. The DNA can be regulatory, structural, pressed as a gene 
product, and the like. In a preferred embodiment, the DNA provides a gene 
product. When transfected into mammalian cells, the artificial chromosomes that 
are formed following transfection harbor and express these DNA sequences. 

Recombinant DNA technology has be«i used increasingly over the past 20 
years for the production of defied biological materials. DNA sequences encoding 
a variety of medically important human gene products have been cloned. These 
include insulin plasminogen activator, al anti-trypsin, and coagulation factors. 
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The present invention, however, encompasses the expression of any and all desired 
medically and/or biologically relevant gene products. 

Once in the cell, the heterologous gene product is expressed in the tissue 
of choice at levels to produce functional gene products. The general consensus 
is that correct tissue-specific expression of most transfected genes is achievable. 
For correct tissue specificity, it may be important to remove all vector sequences 
used in the cloning of the DNA sequence of interest prior to introduction into the 
cdl and formation of the artificial chromosome. Thus, the heterologous gene of 
interest can be incorporated into the artificial chromosome in a controlled manner 
so that the naturally-occurring sequaices are present in their naturally-occurring 
configuration, and tissue spedfidty is assured. 

Synthetic diromosomes can be mtroduced into human stem cells or bone 
marrow cells. Other applications will be clear to those of skill in the art. 

A variety of ways have been developed to introduce vectors into cells in 
culture and into cells in tissues of an animal or human patient. Methods for 
introducing vectors into mammalian and other animal cells include calcium 
phosphate transfection, the DEAE-dextran techmque, micro-injection, liposome- 
mediated tedimques, cationic Bpid-based techniques, transfection using polybrene, 
protoplast fusion techniques, electroporation, and others. These techniques are 
well known to those of skill in the art, and are described in many readily available 
publications and been ext«iavely reviewed. Some of the techmques are re\dewed 
in Transcription and Translation. A Practical Approach, Hames, B.D. & Hig^ns, 
S.J., eds., IRL Press, Oxford (1984), herein incorporated by reference for their 
relevant teachings, and Molecular Cloning, 2nd Edition, Maniatis et al. Cold 
Spring Harbor Laboratoty Press, Cold Spring Harbor, NY (1989), herein 
incorporated by reference for its relevant teaching. 

In the description, reference has been made to various methodologies 
known to those of sldll in the art of molecular biology. Publications and other 
matoials setting fiarth such known methodologies to which reference is made are 
incorporated herein by reference for their relevant teachings. 
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A standard reference work setting forth the general principles of 
recombinant DNA technology is Maniatis, T. et aL, Molecular Cloning: A 
Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1989). 

All terms pertaining to recombinant DNA technology are used in their art- 
recognized manner and would be evident to one of ordinary skill in the art. 

Hie t«ins "Y alpha satellite" and " Ya" are used interchangeably and refer 
to alpha satellite DNA derived from the Y chromosome. 

The terms "17 alpha satellite" and "17a" are used intwdiangeably and refer 
to alpha satellite DNA derived from chromosome 17. 

Alpha satellite DNA is a tandemly-repeated DNA sequence present at 
human centromeres and that comprises a basic monomeric repeat of approximately 
170 bps. This small repeat is organized into higher order units that have been 
shown to be specific to one or a small group of human chromosomes. 

The term "centromeric" means that region of the chromosome that is 
constricted and is the site of attachment of the spindle during meiosis or mitosis. 
It is necessary for the stability and proper segregation of chromosomes during 
meiosis and nutosis and is therefore an essential component of artifidal 
chromosomes. Centromeric DNA comprises a DNA that directs or supports 
kinetochore formation and thereby enables proper chromosome segregation. 
Centromeric DNA at active, fiinctional, centromeres is associated with CENP-E 
during mitosis, as demonstrated by immunofluorescence or immunoelectron 
microscopy. By "associated" is meant that the centromeric DNA and CENP-E co- 
localize by FISH and immunofluorescence. 



"Essential chromosome functions" are discussed in the description and 
background above. These include mitotic stability vwthout experimental selective 
pressure, substantially 1:1 segregation, autonomous replication, i.e., centromere, 
telomere, and origin of replication functions. 

The term "functional equivalent" denotes a genetic function that arises 
from a diflf^ent DNA or protein sequence, but \vbich provides the same biological 
function. 

The term "gene product" denotes a DNA, RNA, protein or peptide. 

The term "genomic DNA" encompasses one or more cloned fragments or 
fragments from a restriction digest or other mbrture of sequences and sizes, for 
example mechanically sheared DNA, or DNA synthesized in vitro. The DNA 
could be derived from the same chromosome (as, for example, when cloned 
fragments are used or when DNA from a purified chromosome is digested) or 
from different chromosomes (as, for example, when a genomic restriction digest 
is used for transfection). 

The term "genomic" refers to DNA naturally found in the genome of an 
organism. However, the invaitors also recognize that the function of this genomic 
DNA could be carried out by DNA from other sources, for example, synthetic 
DNA that has a sequence not found in nature. Thus, as used herein, "genomic" 
DNA is also used generically to refer to the DNA that is introduced into a cell 
along with the centroraeric (telomeric) DNA described herein, and vMch DNA 
represses a g«ie product, or causes expression of a gene product, and allows the 
formaticMi, in a cdl, of a chromosome from purified DNA without the integration 
of the purified DNA into an endogenous chromosrame in the cell. This DNA could 
thus be synthetic or derived from any organism and can be of any size as long as 
it contains the requisite expressible sequence and the function discussed above. 

Thwefore, in addition to the centromeric DNA, the artificial chromosome 
that is encompassed in the invention essentially contains DNA sequences that 
express a gene product, or causes expression of a gene product, and that allows 
the formation of a chromosome from purified DNA without the integration of the 
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purified DNA into an endogenous chromosome in the cell. The sequence that 
fiinctilons to provide the chromosomal expression (e.g., non-integration) and the 
expression sequence can be the same sequence. Thus, it is within the 
contemplation of the inventors that the expressible sequence also pro\ades the 
other functions. Alternatively, the sequence that provides the chromosomal 
fimction and the expresaon fimction may be different sequences and from different 
sources. 

In a specific disclosed embodiment, the genomic DNA is derived from a 
Not I restriction digest. Therefore, in a preferred embodiment, DNA that allows 
the formation of a chromosome from purified DNA without the integration of the 
purified DNA into an endogenous chromosome is derived from a restriction 
fragment generated by the digestion of total genonuc DNA with a restriction 
enzyme having the recognition cite (8 nucleotides) of Not I. However, it is well 
vwthin the contemplation of the inventors to use restriction fragments and other 
fragments of naturally-occurring genomic DNA that are smaller than those 
generated by Not I and comparable enzymes. For example, the inventors 
contemplate reducing the size of the DNA while retaining the fiinctions above. 
Thaefore, in a highly preferred embodiment, the DNA is pared down to contain 
only the DNA necessary to provide for the expression of one or more genes of 
interest and to provide the fimction of allovwng the formation of the chromosome 
from purified DNA without prior integration of the purified DNA into an 
endogenous chromosome. 

The source of these DNAs need not be the same. Thus, the expresaion 
sequence can be derived fix)ra one organism and the sequence that provides the 
chromosomal fimction can be from another organism. Further, one or both 
sequences can be synthesized in vitro and need not correspond to naturally 
occurring sequences. In this respect, the sequences need not strictly be 
"genomic". The only restriction on the sequences is that they provide the 
fiinctions indicated above. 
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The term "heterologous" denotes a DNA sequence not found in the 
naturally-occurring genome in which cell the artificial mammalian chromosome is 
introduced. Additionally, if the sequence is found, additional copies are 
considered "heterologous" because they are not found in that form in the 
naturally-occurring genome. As discussed above, the heterologous DNA can 
simultaneously be the desired expression sequence(s) and the "genomic DNA". 
"Expressible" DNA may not itself be expressed but may allow or cause the 
expression of another DNA sequence, heterologous or endogenous. This is the 
case if the DNA is regulatory, for example. 

The term "mammalian chromosome" means a DNA molecule or genetic 
unit that fimctions as a chromosome in a mammalian cell. 

The term "naked DNA" means DNA that is unassociated with any of the 
biological (chromosomal or cellular) components with which it is normally 
associated in a naturally-occurring chromosome, for example histones, 
non-histone chromosomal proteins, RNA, transcription factors, topoisomerases, 
scaffold proteins, centromwe-binding protdns, and telomere-binding proteins. 
Such DNA can be isolated fi-om cells and purified fi-om the non-DNA 
diromosomal components. Alternatively, this DNA can be synthesized in vitro. 

The torn "naturally-occuning" denotes evaits that occur in nature and are 
not experimentally-induced. 

An origin of replication indicates a site of initiation of DNA synthesis. 
The term "isolated" refers to DNA that has been removed fi-om a cell. The 
term "purified" refers to isolated DNA that has been substantially completely 
separated fi-om non-DNA components of a cell or to DNA that has been 
synthesized in vitro and separated substantially completely from the materials used 
for synthesis that would interfere with the construction of the chromosome fi-om 
the DNA. A purified DNA can also be a DNA sequence isolated fi-om the DNA 
sequences with which it is naturally associated. 

A replicon is a segment of a genome in which DNA is replicated and by 
definition contains an origin of replication. 
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The phrase "retains all the fiinctions of a natural mammalian chromosome" 
means that the chromosome is stably maintained in dividing mammalian cells as 
anon-integrated construct, without experimental selective pressure, indicating at 
least centromeric, telomeric (for linear chromosomes), origin of replication 
functions, and gene expression. 

The term "stable" denotes that the synthetic or artificial chromosome 
remains present in at least 50% of the cells after ten generations in the absence of 
experimental selective pressure (such as drug selection and the like), and most 
preferably, that after 30 generations, it is present in at least 10% of the cells; and 
preferably, the synthetic chromosome exhibits 1:1 segregation greater than 99% 
of the time. 

The terms "synthetic" or "artificial" are used interchangeably. A 
"synthetic" or "artificial chromosome" is a construct that has essential 
chromosome functions but which is not naturally-occurring. It has been created 
by introducmg purified DNA into a cell. Since the chromosome is composed 
entirely of transfected DNA, it is referred to as synthetic or artificial. An artificial 
or synthetic chromosome is found in a configuration that is not naturally- 
occurring. 

The term "transfecting" denotes the introduction of nucleic acids into a 
cell. The nucldc add thus introduced is not naturally in the cell in the sequence 
introduced, the physical configuration, or the copy number. 

A telomere denotes the end of a chromosome comprising simple repeat 
DNA that is synthesized by a ribonucleoprotein enzyme called telomerase. The 
function is to allow the ends of a linear DNA molecule to be replicated. 

A nucldc acid molecule such as a DNA or gene expresses a polypeptide 
or gene product if the molecule contmns the sequences that code for the 
polypeptide and the expression control sequences which, in the appropriate host 
environment, provide the ability to transcribe, process and translate the genetic 
information contained in the DNA in a protein product and if such expression 
control sequences are operably linked to the nudeotide sequence which encodes 
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the polypeptide. However, as discussed herein, a gene product need not be 
restricted to a polypeptide gene product but may encompass RNA. Further, 
genetic defects that are capable of being corrected by the artificial mammalian 
chromosomes when used as expression vectors may be defects that operate in cis 
to effect further gene expression. 

An operable linkage is a linkage in which the regulatory DNA sequences 
and the DNA sequence sought to be expressed are connected in such a way as to 
permit gene expression. The precise nature of the regulatory regions needed for 
gene expression may vary from organism to organism but in general include a 
promoter region, 5' non-coding sequences involved with initiation of transcription 
and translation such as the TATA Box, CAP Sequence, CAAT Sequence, and the 
like. If desh-ed, the non-coding region 3 ' to the gene sequence coding for the 
protein may be obtained by the above-desaibed methods. This region may be 
retained for its transcriptional termination regulatory sequences such as 
termination and polyadenylation. Thus, by retaining the 3' re^on naturally 
contiguous to the DNA sequence coding for the protein, the transcription 
termination signals may be provided. Where the transcaiptional termination signals 
are not satisfectorily functional in the expression host cell, then a 3' region 
fimctional in the host cell may be substituted. 

The following examples do not limit the invention to the particular 
embodiments described, but are presented to particularly describe certain ways in 
which the invention may be practiced. 
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Examples 

Experimental Procedures 

Description ofDNA constructs 

Standard molecular biology techniques wweused to construct all plasmids 
described here (Sambrook, J. et al, eds., "Molecular Cloning", Cold Spring 
Harbor Laboratory Press (1989)). Cloning of the alpha satellite higher order 
repeat from the Y chromosome and chromosome 17 has been described previously 
(Wolfe, J. etal, J. Mol Biol 182:477-435 (1985); Van Bokkelen, G.B. etal., 
"Method for Stably Cloning Large Repeating Units of DNA", U.S. Patent 
Application(1995)); Waye & Willard, Mol. Cell Biol (J:3 156-65 (1986)). By 
directional cloning through the creation of the appropriate restriction sites, 
successively larger alpha satellite arrays have been created in the plasmid 
pBAC108L Van Bokkelen, G.B. et al, "Method for Stably Cloning Large 
Repeating Units of DNA", U.S. Patent AppUcation No. 08/487,989, filed June 7, 
1995, yfAaxh is incorporated herdn by reference for teaching the cloning of large 
tandem arrays of repetitive sequences. 

Plasmids used in the experiment 

pBAC108L has been described previously (Shizuya, H. etal, Proc. Natl 
Acad Sci. USA 5P:8794-7 (1992)). pVJIOS is a modified version of pBAClOSL 
that contains additional restriction sites in the polylinker and a p-geo expression 
unit consisting of the CMV immediate early gene promoter and SV40 
polyadenylation signal (Seed, B., Nature 52P:840-2 (1987); Seed & Aruffo, Proc. 
Natl Acad. Sci. USA 84:3365-9 (1987)), the p-geo open reading fi^me 
(MacGregor, G.R. et al. Development 727:1487-96 (1995)), and the UMS 
transcriptional termination sequence (Heard, J.M. et al, Mol Cell Biol 
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7:2425-34 (1987); McGeady, MX. et al., DNA 5:289-98 (1986); Salier & 
K\xaicU, Biotechniques 7:30-1 (1989)). pBACYal6 (92 kb of Y alpha satellite) 
consists of 16 identical higher order repeats cloned head-to-tail into pBAClOSL. 
pBAC17a32 (87 kb of 17 alpha satellite) consists of 32 identical higher order 
repeats cloned head-to-tail into pBAClOSL. pVJ105-Ya 16 was made by cloning 
the alpha satellite array from pBAClOSL into pVJlOS. Following linearization 
with BamHI and Sfil, the direction of P-geo expression is toward the alpha 
satellite array. pVJ105-17a32 was made by cloning the alpha satellite array from 
pBAC17a32 into pVJ105. Following linearization as above, the direction of 
p-geo transcription is toward the alpha satellite array. All plasmids were purified 
by alkaUne lysis (Sambrook, J. et al, eds., "Molecular Cloning", Cold Spring 
Hari)or Laboratory Press (1989)) followed by agarose gel purification. 



Creadon of alpha satellite arrays >100 kb by muldmerization 



To create Y alpha satellite arrays, pVJlOS Yal6 was digested with BamHI 
and Sfil and gel purified by pulsed field gel electrophoresis (PFGE). Additional 
alpha satdOte DNA was prepared by digesting pB ACYa 1 6 with BamHI and Bglll 
and gel purifying the 92 kb alpha satelihe fi-agment by PFGE as above. Following 
band isolation, the agarose bands were equiDbrated in 10 mM Tris pH 7.5, lOOmM 
NaCI, 10 mM MgClj and then melted at 65° for 5 minutes. The two fragments 
were then combined at a molar ratio of 5:1 (pBACYal6 alpha satellite fragment: 
pVJ105-Yal6 fiiagment). ATP (1 mM final) and T4 Ligase (5 units) were added 
and the reaction was incubated at 37 °C for 4 hours in the presence of BamHI (40 
units) and BgUI (40 units), p agarase (3 units) was added and the reaction was 
incubated at 37°C for 1 hour. The reaction was then placed on ice for 1 hour 
prior to transfection into HT1080 cells. To create extended alpha satellite arrays 
for 17 alpha satellite DNA, the above procedure was used witii pVJ105-17a32 
and pBAC17a32 in place of pVJ105-Ycxl6 and pBAC17a32, respectively. 
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Pteparation of high molecular weight human genomic DNA 

Four 150 mm plates containing HT1080 cells were grown to confluent^, 
removed from the plates with trypsin/EDTA, and washed with 100 ml PBS. High 
molecular weight DNA was harvested in low gelling temperature agarose plugs 
(Sambrool^ J. etaL, eds., "Molecular Cloning", Cold Spring Harbor Laboratory 
Press (1989)). Approximately 1 A/g of human genomic DNA was digested with 
Notl. Following digestion, NotI was inactivated by heating the reaction to 70' C 
for 5 min. Prior to transfection, the agarose plug was digested with 3 units of 
p-agarase. 

Pr^aradon of tdomeric DNA 

Human telomeric DNA was generated by PGR using primers 42a 
(5'GGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGGTTAGGG3') and 42b 
(5*CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC3') <Ijdo, J.W. 
etoL^ Nucleic Adds Res. /P:4780 (1991)). Each PGR reaction contained 250 ng 
of 42a and 42b, 5 Units Taq polymerase, 250 fJM. dNTPs, 3.3 mM MgCIj in IX 
PGR BuflFer (Gibco BRL). The PGR reaction was carried out for 35 cydes in a 
Potkin Elmo* 9600 Thermal cycler using the following temperature profile: 95°G 
for 20 seconds, 40°G for 20 seconds, 12°C for 2 minutes. Following PGR, each 
reaction was subjected to agarose gel electrophoresis to purify telomeric DNA 
tiiat is greater than 1 kb in size. This DNA was excised from the gd and purified 
away from the agarose using Magic Prep columns according to the manufecturer's 
instructions (Promega, WI). 
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Transfection of human cells 

Prior to transfection, pVJlOS Yal6 and pVJlOS 17a32 were digested with 
BamHI and Sfil; pBac Yal6 and pBac Ya32 were digested with BamHI and 
Bgm, The DNA was then purified by PFGE, equilibrated against 10 mM Tris pH 
7.5, 100 mM NaCl, and combined with telomeric DNA and/or NotI digested 
human genomic DNA. In some cases, the alpha satellite arrays were extended 
using the du-ectional ligation approach described in Figure 1. 

Once the DNA components for each transfection were combined and 
g«itly mixed. Transfections contained either pVJ105 Yal6 (0,5-1 jig), pVJ105 
17a32 (0.5-1 ^g), or pVJ105 VK75 (0.5-1 ng). Where indicated, the 
transfections also contained purified Ydl6 arrays (0.5-l(ig), 17a32 arrays 
(0.5-1 ug) telomeric DNA (75-250 ng), human genomic DNA (1-3 and/or 
VK75 fi-agment (0.5-2 |ig). 1 ml serum fi-ee a-MBM media (MediaTech) was 
added. 7.5 ^A lipofectin was then added, and the solution was incubated at room 
temperature for 5 minutes. The DNArlipofectm nuxture was added to 2 x 10** 
HT1080 cells, according to the manufecturer"s instructions (Gibco BRL). After 
a 16 hour incubation at 37''C, the DNArlipofectin solution was removed and 
complete media was added to the cells. At 36 hours post transfection, the cells 
were removed fi-om the wells with trypsin/EDTA and transferred to a 100 mm 
plate containing complete media supplemented with 300 /^g/ml G418. On the 
seventh day of selection, the media was replaced with fi-esh complete media 
supplemented with 300 //g/ml G418. After 12 days of selection, individual 
colonies were isolated using sterile cloning rings and placed into 24 well plates. 
The indhidual clones fi-om each transfection were then expanded under selection 
into 100 mm plates. A portion of each culture was fi-ozen for future analysis, 
while the bulk was harvested for analysis by FISH. 



-40- 



Cell culture 

HTIOSO cells were grown in Alpha MEM media (Gibco/BRL, Bethesda, 
MD) supplemented with 15% fetal bovine serum (Hyclone), 
penidllin/streptomycin, and glutamine. The subclone of HTIOSO used in these 
experiments was tetraploid. 

Plate staining 

Cells containing the synthetic chromosomes were plated in 6 well plates 
at 10% confluenQT. Untransfected HTIOSO cells were similarly plated and used 
as a negative control. When the cells reached 70 % confluency, the media was 
removed and the cells were washed with 2 ml PBS. After removing the PBS, 1 
ml of fix solution (2% formaldehyde, 0.2% glutaraldehyde in PBS) was added to 
each well and the plate was incubated at room temperature for 4 minutes. The fix 
solution was removed and the cells were immediately washed with 2 ml PBS. 
Finally, PBS wash was removed and 1 ml of staining solution (5mM potassium 
ferricyanide, 5mM potassium ferrocyanide, 5 mM MgClj, and 1 /^gffA X-Gal in 
PBS) was added to each well and the plate was incubated for 12 hours at ST'C. 
The cells were washed vwth PBS and imaged using a light microscope and 
assodated ima^g hardware (Oncor, Gaithersburg, MD). 

Fluorescence in situ hybridization 

HTIOSO cells were grown on 100 mm tissue culture plates, harvested for 
FISH, and mounted onto slides according to published procedures (Verma, R. & 
Babu, A., Human Chromosomes, Principles and Techniques, 2nd Edition, 
McGraw-Hjll, Inc. (1995)). To detect alpha satellite sequences on the synthetic 
chromosomes and on the endogenous chromosomes, chromosome specific alpha 
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satellite probes were used accordbg to manufacturers instructions (Oncor, 
Gaithersburg, MD). 

Determination of synthetic Y alpha satellite DNA content in clones 
containing synthetic chromosomes 

Genomic DNA was harvested from HT1080 cells and from clones 22-6, 
22-7, 22-11, 22-13, and 23-1 according to published procedures (Sambrook, J. 
et al., eds.. Molecular Cloning, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1989)). Approximately 10 fig DNA from each clone was 
then digested with EcoRI (50 units) and PstI (50 units) overnight at 37°C. The 
samples were then electrophoresed through a 0.8% agarose gel, transferred to 
Nytran membrane, and hybridized to a 1 kb Y alpha satellite probe in 25% 
fonnamide/10% dextran/0.5% SDS/0.5M NaCl/200 /^g/ml salmon sperm DNA 
overnight at 65°C. EcoRI and PstI both cleave once in the endogenous Y alpha 
satellite higher order repeat to give a 4 kb and a 1.7 kb band. However, due to 
the method used to create the synthetic^Y alpha satellite arrays, EcoRI does not 
cleave the synthetic higher order repeat. As a result, EcoRI and Psfl digestion of 
the synthetic array results in a 5.7 kb band. Since we know that the endogenous 
Y alpha satellite array is 1 mb in length, we can determine the amount of synthetic 
alpha satdfite DNA in the cells contdmng synthetic chromosomes by determining 
the ratio of the 5.7 kb band (synthetic array) to the 4.0 kb band (endogenous 
array). It is not necessary to consider the 1.7 kb band since it does not hybridize 
with the probe under these hybridization conditions. It is important, however, to 
consider that most of these dones contain 2 Y chromosomes (because they are 
tetraploid) and only a sin^e synthetic chromosome. One exception to this is clone 
22-13 which contains a single Y chromosome and appears to be diploid. Thus, for 
clones 22-6, 22-7, and 22-1 1, the amount of synthetic alpha satellite DNA per cell 
is estimated by the following equation: 
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Intemity of 5.7 kh hand x Imb X 2 
Intensity of 4 kb hand 

For clone 22-13, the following equation was used: 

Intensity of 5.7 kb hand ^ , , 

— A. 1 mo 

Intensity of 4 kb band 

Immunofluorescence 

Anti-CENP immunofluorescence was carried out according to published 
procedures (SulUvan & Schwartz, Hum. Mol. Genet. ^:2189-2197 (1995)). 
Briefly, HT1080 cells were grown in tissue culture plates until approximately 80% 
conflurawy. Colcanid was then added to a final concentration of 40 ng/ml and the 
cells were incubated at 37°C for 75 minutes. The media was carefully removed 
and the cells were released fi-om the plate by incubation with tiypsin/EDTA for 3 
to 5 minutes. To neutrelize the trypsin, complete media was added to the cells and 
the resulting cell suspension was counted using a hemocytometer and spun at 
1000 rpm in a Jouan CT422 centrifuge. The supematent was discarded and the 
cells were resuspended at 0.6 xlO^ cells/ml by slowly adding hypotonic solution 
(25 mMKCl, 0.27% sodium citrate). Cells were incubated in hypotonic solution 
for 12 minutes at room temperature. 500 tA of cells were then added to a 
cytofimnel and spun at 1900 rpm for 10 minutes in a Shandon Cytospin 3 
centrifoge. The slides were then incubated in 10 mM Tris pH 7.7, 120 mM KCl, 
20 mMNaCl, 0.1% Triton X-100 for 12 minutes. Diluted antibody (50 (A, 1/1000 
in 1 mM triethanolamine, 25 mM NaCl, 0,2 mM EDTA, 0.5% Triton X-100, 
0.1% BSA) was added to each slide and a plastic cover slip was positioned over 
the cdls. Following a 30 minute incubation at 37°C, the coverslip was removed, 
and the slides were washed 3x2 minutes in a Coplin jar containing KB (10 mM 
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Tris pH 7.7, 150 mM NaCl, 0. 1% BSA). FITC-labeled anti-rabbit Ig was added 
(50 yul of 1/100 in KB) and a plastic cover slip was placed over the cells. 
Following a 30 minute incubation at 37 "C, the slides were washed 3x2 minutes 
in a Coplin jar containing KB. Before viewng, the slides were counterstained 
with 10 (A DAPI (2 yug/ml in antifede). Images were collected using a fluorescent 
microscope and imaging system (Oncor, Gaithersburg, MD). 

Mitotic stability time course 

Following cloning, cells were expanded into two 100 mm plates and grown 
in the presence of 300 /zg/ml G418. At 80% confluency, one plate for each clone 
was harvested for FISH analysis using the protocol described above. These cells 
serve as the time point of the time course. The other plate was split 1/16 into 
a 100 mm plate and grown to confluency in complete media lacking G418. As 
soon as the culture reached confluency, the cells were split 1/16 and grown in 
complete media lacking G418. This process was repeated for the period of time 
indicated in Table 2. At various time points, a portion of the culture was 
harvested for FISH and analyzed for the presence of the transfected alpha satellite 
(17a or Ya). For each intact chromosome spread, the number of Y alpha satellite 
(or 17 alpha satellite) signals and thek chromosomal positions were determined. 

Results 

The mammalian centromere is a complex chromosomal element thought 
to consist of large blocks of repetitive DNA, called alpha satellite. One of the 
major impediments inhibiting the elucidation of mammalian centromere structure 
and preventing the development of artificial human chromosomes has been the 
inability to done large segments of this class of DNA. Recentiy, methods for the 
doning and laige scale production of alpha satellite DNA up to approximately 175 
kb in length have been developed (Van Bokkelen, G.B. et aL, entitled "Method 



for Stably Cloning Large Repeating Units of DNA" U.S. Appl. No. 08/487,989, 
filed June 7, 1995. Equally important, the use of a directional cloning strategy 
allows the creation of alpha satellite arrays of known composition and structure. 

In order to facilitate the formation of a functional centromere fi-om naked 
transfected alpha satellite DNA, the inventors hypothesized that it could be 
advantageous to transfect alpha satellite DNA which is greater than 175 kb in size. 
Pre\iously, the largest contiguous alpha satellite array to be transfected into 
mammalian cells was 120 kb (Larin, Z, etal. Him. Mol Genet 5:689-95 (1994)). 
To produce alpha satellite DNA much larger than 175 kb, the directional ligation 
strategy shown in Figure lA was used. This in vitro technique allows the 
production of contiguous, uninterrupted Y alpha satellite arrays up to 736 kb in 
length (Figure IB, lanes 2-4). As a control, VK75 (a 75 kb Bss HII fragment) was 
ligated in the presence and absence of BssHII (Figure IB, lanes 5-7). Since 
BssHII ends regenerate a BssHII site when ligated, the ladder of multimers is 
digested down to constituent monomers when BssHII is included in the ligation 
reaction. Similar results were obtained in experiments carried out using BamHI 
fragments or Bglll fragments (data not shown). This demonstrates that the 
recleavage reaction is efiBcient and that the ladder in lanes 2 and 3 are the result 
of head-to-tail ligations. Finally, to test for biological differences between 
sqjarate fenulies of alpha satellite DNA, ©ctended arrays were also built consisting 
of alpha satellite DNA derived from chromosome 17 (data not shown). 

The inventors have utilized these large purified alpha satellite arrays to 
produce synthetic chromosomes using the strategy outlined in Figure 2 and 
described in Experimental Procedures. By cotransfecting each of these 
chromosome components, the inventors reasoned that the cell would combine 
these elements to form a fiinctional chromosome. Accordingly, HT1080 cells 
were transfected with various combinations of alpha satellite DNA, telomeric 
DNA, and human genonuc DNA. Following transfection, the cultures were 
placed under G418 selection for 10-14 days. Individual colonies were then 
isolated, expanded under selection, and harvested for FISH analysis. 
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Characterization of stable transfectants 

As shown in Table 1, in clones from the majority of transfections, alpha 
satellite DNA had integrated into an endogenous chromosome. In many 
transfections that included telomeric DNA, a high incidence of alpha satellite 
integration events associated with chromosome truncations was observed also. 
It has been observed previously that telomeric DNA can be used to efficiently 
tnmcate human chromosomes foUovwng integration (Bamett, M.A. et ah. Nucleic 
Acids Res. 27:27-36 (1993); Brown, K.E. et al. Hum. Mol. Genet. 5:1227-37 
(1994); Farr, C.J. et al, EMBO J. 7^:5444-54 (1995)). Here, telomeric DNA 
apparently integrated mto the endogenous chromosome along with alpha satellite 
DNA and caused a truncation event. 

In cells from a subset of transfections, however, synthetic chromosomes 
that contained the transfected alpha satellite DNA were observed (Table 2, 
transfections 22 and 23 and Figures 3-5). These positive transfections differed 
from the other transfections in two ways. First, prior to transfection, the alpha 
satellite DNA vras prdigated in vitro in the presence of BcanHI and Bglll (Figure 
lA and IB). This resulted in the generation of large, directional alpha satellite 
arrays ranging in saze from 100 kb to 736 kb in length. Second, NotI digested 
human genonuc DNA was included in the transfection. By including these two 
components, the essential DNA sequences necessary for synthetic chromosome 
formation wwe provided. 

FISH analysis of clones from transfections 22 and 23 revealed that 
approximately 50% of the G418 reastant clones contained synthetic chromosomes 
(Table 1). In four of the five synthetic chromosome containing clones from these 
two transfections, the transfected alpha satellite DNA was detectable only on the 
synthetic chromosome (Figures 2 - 4). That is, in the case of transfected Y alpha 
satellite DNA, only the synthetic chromosome and the Y chromosome had 
detectable signals by FISH Likewise, in the case of transfected 17 alpha satellite 
DNA, only the synthetic chromosome and chromosome 17 had detectable signals 
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for 17a by FISH. Interestingly, synthetic chromosomes formed in both 
transfections 22 and 23. This demonstrates that alpha satellites from the Y 
chromosome and from chromosome 17 are both capable of facilitating synthetic 
chromosome formation. As further evidence that alpha satellite DNA is an 
important component of the synthetic chromosomes, the alpha satellite FISH 
signal encompasses most or all of each synthetic chromosome. 

In cells that contain a synthetic chromosome, there were only two 
exceptions who-e alpha satellite DNA (derived from the same chromosome as the 
synthetic a satellite DNA used in the transfection) was detected on a chromosome 
other than the synthetic chromosome and Y chromosome (or chromosome 17 in 
cases where 17 a satellite was transfected). First, in clone 17-15, 17 alpha 
satellite DNA was detected on the synthetic chromosome, chromosome 17, and 
at the end of a C group chromosome (Figures 5A and B). Interestingly, this 
transfection contained unligated alpha satellite and telomeric DNA, but no human 
genomic DNA. One possibility is that the transfected DNA integrated into the 
endogenous diromosome, amplified, and broke back out. It is important to note 
that if non-alphoid, non-telomeric human sequences are necessary for 
diFomosome frmction, then this mechanism of synthetic chromosome formation 
might be necessary to provide additional DNA el«nents in the absence of human 
genomic DNA in the transfection. In the one case in which a synthetic 
chromosome formed in the absence of cotransfected human genomic DNA, alpha 
satellite was also found integrated into an endogenous chromosome. This shows 
that genomic DNA is nec^sary for some aspect of synthetic chromosome 
formation or maintenance. Second, in clone 22-1 1, both a synthetic chromosome 
and a Y:14 chromosome translocation were observed (Figures 5C and D). By 
pulsed field gel electrophoresis, the inventors have demonstrated that the Y;14 
translocation contains endogenous Y alpha satellite DNA, and not synthetic Y 
alpha satellite DNA. Thus, the synthetic alpha satelUte DNA is only detectable on 
the microchromosome, and not on any endogenous chromosome. 
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Estimadon of synthetic chromosome size 

The amount of alpha satellite DNA present in the synthetic chromosome 
containing cells ranges from about 350 kb to 2 mb (Figure 6). This was 
determined by taldng advantage of restriction site polymorphisms between the 
synthetic and endogenous alpha satellite arrays. By comparing the intensity of the 
synthetic alpha satellite band to the endogenous alpha satellite band on a Southern 
blot, the ratio of synthetic alpha satellite DNA to endogenous alpha satellite DNA 
can be determined. Since the endogenous alpha satellite array is 1 mb in length 
(Larine/a/., Hum. Mol Genet. 3:689-95 (1994)), and since the copy number of 
the Y chromosome (2 for clones 22-6, 22-7, and 22-1 1 and one for clone 22-13) 
and the copy number of the synthetic chromosome are known (Table 2), the 
amount, in kilobases, of synthetic alpha satellite DNA (Figure 5B) can be 
estimated. 

Although it is difficult to estimate the overall size of these synthetic 
chromosomes, in some cases, the synthetic chromosome is barely detectable using 
a fluorescence microscope at lOOOx magnification. 

Synthetic chromosome structure and copy number 

Upon initial analysis, each clone of synthetic chromosome contmning cells 
possessed way few synthetic chromosomes, and in most cases, only one per cell. 
This shows that the copy number of the synthetic chromosomes is regulated like 
that of the endogenous chromosomes. 

In addition to copy number, the synthetic chromosomes share two other 
features with the endogenous chromosomes. First, they contain telomaic 
sequences (data not shoAvn). This suggests that these synthetic chromosomes are 
linear. Second, in metaphase chromosomes, the individual chromatids are clearly 
viable on each synthetic chromosome (Figures 4-5). This shows that the overall 
structure of the synthetic chromosome is similar to that of the endogenous 
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chromosomes. Furthermore, since chromatids are normally held together at the 
centromere this result also shows that the synthetic chromosomes are capable of 
carrying out at least one centromeric function, the attachment of sister 
chromatids. 

CENP-E associates with synthetic chromosomes during metaphase 

The presence of synthetic chromosomes (in most cases at single copy) in 
dividing cells shows the creation of a functional centromere. In order to fiirther 
investigate this, several of the synthetic chromosomes were tested to determine 
whether CHn[P-E was present at the centromere during metaphase. It has been 
shown previously that CENP-B is present at both functional and nonfimctional 
centromeres (Eamshaw, W.C. et al., Chromosoma P5:l-12 (1989)), and 
therefore, it can not be used as a marker for centromere activity. For this reason, 
CREST antisera (used in previous experiments: Haaf et al, Larin et al, and 
Praznovsky et al. (cited above)), which generally recognizes CENP-B very 
strongly, is not a good reagent for assessing centromere activity. On the other 
hand, CENP-E has be^ shown to be present only at functional centromeres 
(Sullivan & Schwartz, Hvm. Mol Genet. ¥:2189-2197 (1995)), and therefore, 
monospedfic antibodies to this protein can be used to assess centromere activity. 

Consistent with the presence of a functional centromere, it was found that 
CENP-E was present on the ^thetic chromosome in clones 22-1 1 and 23-1, the 
only clones t^ed to date figure 7). Furthamore, the amount of CENP-E on the 
synthetic chromosome is similar to that present at the centromere of each of the 
«idogenous chromosomes. This is interesting because CENP-E is not thought to 
bind to centromeric DNA directly, and therefore, its level does not depend on the 
amount of alpha satellite present. Instead, it depends solely on whether a 
fimctional Idnetochore has formed. Thus, the presence of CENP-E on the 
synthetic chromosome during metaphase strongly suggests that they contain a 
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fiinctional centromere enable of directing formation of a centromere/kinetochore 
complex. 

Synthetic chromosomes are mitoticaify stable in the absence of 
selection 

To confirm that the synthetic chromosomes contain a functional 
centromere and are capable of correctly segregating in dividing cells, the synthetic 
chromosome containing cells were grown for a defined period of time in the 
absence of selection. The ceUs were then analyzed by FISH to determine the 
percentage of cells that contained the synthetic chromosome. After 46 days 
(approximately 60 cell generations) in the absence of selection, the synthetic 
chromosomes were still present in the majority of cells (Table 2). Li several 
clones, the synthetic chromosome was still present in 100% of the cells. This 
indicates that the synthetic chromosomes are mitotically stable, and therefore, 
validates the idea that these vectors can be used to transfect dividing cells to 
correct genetic defects in vivo. 

In addition to determining the segregation efficiency of each synthetic 
chromosome, this experiment also allowed us to assess the structural stability of 
the qrathetic diromosomes ova- time. After scanning 50 chromosome spreads for 
each clone, no cases in which the synthetic chromosome integrated into an 
endogenous chromosome were observed. Furthermore, no other gross 
rearrangements mvolving the synthetic chromosomes were observed. This result, 
in conjunction with their high degree of mitotic stability, demonstrates that these 
synthetic chromosomes behave as separate genetic units with many of the same 
characteristics as endogenous human chromosomes. 
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Gene agression from the synthetic chromosomes 

The synthetic chromosomes described here provide an alternative vector 
for somatic gene therapy. It is, therefore, important to determine whether 
heterologous genes can be efficiently expressed from these chromosome vectors. 

As described in the experimental procedures, the synthetic chromosomes 
were created by co-transfecting pVJ104-Yal6 or pVJ104-17a32 with telomeric 
DNA and human genomic DNA into HT1080 cells. In each transfection, the 
p-geo expression unit was linked to at least 100 kb of alpha sateUite DNA. 
Following transfection, the location of alpha satellite in the cell is the same as the 
location of the p-geo gene. Thus, in the synthetic chromosome clones, with the 
exception of clone 17-15, the P-geo gene is located exclusively on the synthetic 
chromosome. 

To determine the levels of P-geo expression in each of the synthetic 
chromosome containing clones, and therefore the extent of gene expression from 
the synthetic chromosome, the cells were assayed using the X-gal plate staining 
method described in the experimental procedures. Although this technique is 
relatively insensitive (i.e. p-geo expression must be high in order to be detected), 
it provided a rough approximation of expression levels and the percentage of cells 
expressing this marker gene. After 70 days in culture without G418 selection 
(approximately 80 cell divisions), at least 50% of the cells in clones 22-11 
expressed p-geo at levels detectable in this assay (Figure 8). In clone 22-6, 
approximately 25% of the cells had detectable p-geo activity after 70 days in the 
absence of selection (data not shown). Expression in the other clones could not 
be evaluated due to the insensitivity of this assay and due to the lower expression 
of p-geo in these cells. 
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Discussion 

The results show that naked DNA can be transfected into mammalian cells 
and, without integrating into an endogenous chromosome, form a functional 
synthetic chromosome. These synthetic chromosomes have many of the 
ciiaractenstics of normal mammalian chromosomes. First, they are present in the 
cell at low copy number, usually one per cell. Second, the synthetic chromosomes 
appear to be linear and contain telomeric sequences. Third, during mitosis, 
CENP-E is bound to the synthetic chromosome indicating the formation of a 
fiuictional kinetochore. Fourth, the synthetic chromosomes are mitotically stable 
in the absence of selection. Fifth, the synthetic chromosomes are capable of 
harboring transcriptionally active genes. Finally, the synthetic chromosomes are 
structurally stable over time, with an undetectable mtegration frequency. Unlike 
normal human chromosomes, the synthetic chromosomes are small and easily 
manipulated allowing different genes to be expressed in a variety of chromosomal 
contexts. 

The results show in vitro methods for produdng alpha satellite arrays up 
to 736 kb in length. In addition to prowding an essential component to tiie 
synthetic chromosomes described here, these results demonstrate that alpha 
satellite can be produced in clinically useful quantities. Previously, there has been 
no method available allowing structurally intact alpha satellite DNA greater than 
200 kb to be purified in the quantities necessary for the transfection of mammalian 
cells. 

As a control, the inventors recreated the previous failed experiments of 
Haaf aL, creating a chromosome with the concomitant integration of a satelKte 
DNA into an endogenous chromosome. The transfection in which this occurred 
lacked additional genomic DNA sequences. Without genomic sequences, it is very 
lik^ tiiat the chromosome fi>rmed as a result of a breakage event from one of the 
endogenous chromosomes that contain integrated alpha satellite DNA. In addition 
to being an inefricient and infrequent event, this approach is not useM for gene 
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therapy procedures due to the risk of inducing genomic rearrangement and 
malignant transformation in the host cell as a result of the chromosome breakage 
mechanism. 

In summary, the inventors have demonstrated that mitotically stable 
synthetic diromosomes can be created by transfecting large alpha satellite arrays, 
tek>m^c DNA, and genonuc DNA together into a human cell. Although each of 
these components appears to be necessary for efiScient synthetic chromosome 
formation, it is possible that genomic rearrangements foUovwng integration of 
alpha satellite DNA can lead to chromosome formation in the absence of genomic 
(non-alpha satellite, non-telomeric) sequences. On the other hand, alpha satellite 
DNA q)pears to be absolutdy required to produce these synthetic chromosomes. 
Her^ by creating synthetic diromosomes using alpha satellite derived from the Y 
chromosome and from chromosome 17, the mventors have dmonstrated that the 
source of alpha satellite DNA is not important. In other words, alpha satellite 
DNA from any chromosome can be used to create synthetic chromosomes. 
Furthermore, given that human chromosomes are stable in a variety of hybrids, 
includhig mouse, hamster, and primate, alpha satellite-like sequences from these 
other species can also be used to create synthetic chromosomes such as those 
described h^e. 



Table 1 





Clone 


Ya 


17a 


hg 


tel 


VK75 


Characteristics 




CI 0-7 


XX 






X 




Integrant (mid-Q of a large 
chroinosoine) 




ClO-10 


XX 






X 




Integrant (niid arm-medium size 
chromosome) 


5 


CI 1-7 




X 




X 




Integrated (forms a 17p+) 




Cll-19 




X 




X 




no signal 




C12-2 




XX 




X 




no signal 




C12-3 




XX 




X 




no signal 




C12-5 




XX 




X 




Integrated (mid arm) 




C12-6 




XX 




X 




Integrated (non-telomeric) 




C12-14 








X 




no signal 




C12-16 












no signal 




C13-1 








X 


X 


Integrated 




C14-1 


X 






X 


X 


Telomere directed truncation 


-15 


C15-2 


X 






X 




Chr. 6 truncation' de novo telomere 




Ci5-3 


X 






X 




no signal 




C15-4 


X 






X 




Integrated (p arm of a 16 like 

chrom) 




C15-5 


X 






X 




no signal 




C15-10 


X 






X 




Integrated into a telocentric 
chromosome 


20 


C15-12 


X 






X 




Integrated into chrom 1 7 below 
centromere 




C15-13 


X 






X 




no signal 




C15-21 


X 






X 




no signal 




C16-6 


XX 






X 




Telomeric/telocentric p-constriction 




C16-7 


XX 






X 




no signal 


25 


C17-2 




X 




X 




no signal 




C17-8 




X 




X 




no signal 




C17-11 




X 




X 




truncation of C group chromosome 




C17-15 




X 




X 




microchromosome, truncation of C 
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Clone 


Ya 


17c 


hg 


tel 


VK75 


Characteristics 


C17-19 




X 




X 




no signal 


C19-1 


X 




X 


X 




no signal 


C19-2 


X 




X 


X 




Ambiguous (telomere directed 
truncation?) 


C21-1 






X 


X 


X 


no signal 


C22-2 


(XX) 




X 


X 




Chr. truncation (de novo telemere 
@19p) 


C22-3 


(XX) 




X 


X 




no signal 


C22-4 


(XX) 




X 


X 




Telomeric/Tossible dicentric 


C22-5 


(XX) 




X 


■X 




Ambiguous; very small micro? 


C22-6 


(XX) 




X 


X 




Double micro (multiple micro) 


C22-7 


(XX) 




X 


X 




Large micro 


C22-8 


(XX) 




X 


X 




Ambiguous (possible micro) 


C22-9 


(XX) 




X 


X 




Telomeric/large array 


C22-11 


(XX) 




X 


X 




Small micaro 


C22-13 


(XX) 




X 


X 




Large micro 


C23-1 




(XX) 


X 


X 




Small mioio 



Table 1. Results from the trmisfection of various combinations of alpha satellite DNA, 
tdomaic DNA, and human genomic DNA into human cells. Ya and I7a are abbreviations 
for alpha satellite from the Y chromosome and chromosome 17, respectively, hg is an 
abbreviation for human genomic DNA that was digested vnihNotL Tel is an abbreviation 
for telom^c DNA. VK75 is an abbreviation for a 75 kb fragment from the X chromosome. 
X indicates that a sequence was included in the transfection. XX indicates that additional 
purified alpha satellite DNA was included in the transfection, as described in the 
Esqperimental Procedures. (XX) indicates that additional alpha satellite DNA was preligated 
to the b-geo/alpha satellite construct prior to transfection, as desaibed in the Experimental 
Procedures. 
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